First-Order Algorithm with O(ln(1/e)) Convergence for e-Equilibrium in Two-Person Zero-Sum Games
نویسندگان
چکیده
We propose an iterated version of Nesterov’s first-order smoothing method for the two-person zero-sum game equilibrium problem min x∈Q1 max y∈Q2 xAy = max y∈Q2 min x∈Q1 xAy. This formulation applies to matrix games as well as sequential games. Our new algorithmic scheme computes an -equilibrium to this min-max problem in O ( ‖A‖ δ(A) ln(1/ ) ) first-order iterations, where δ(A) is a certain condition measure of the matrix A. This improves upon the previous first-order methods which required O(1/ ) iterations, and it matches the iteration complexity bound of interior-point methods in terms of the algorithm’s dependence on . Unlike interior-point methods that are inapplicable to large games due to their memory requirements, our algorithm retains the small memory requirements of prior first-order methods. Our scheme supplements Nesterov’s method with an outer loop that lowers the target between iterations (this target affects the amount of smoothing in the inner loop). Computational experiments both in matrix games and sequential games show that a significant speed improvement is obtained in practice as well, and the relative speed improvement increases with the desired accuracy (as suggested by the complexity bounds).
منابع مشابه
First-Order Algorithm with O(ln(1/ )) Convergence for -Equilibrium in Two-Person Zero-Sum Games
We propose an iterated version of Nesterov’s first-order smoothing method for the two-person zero-sum game equilibrium problem min x∈Q1 max y∈Q2 xAy = max y∈Q2 min x∈Q1 xAy. This formulation applies to matrix games as well as sequential games. Our new algorithmic scheme computes an equilibrium to this min-max problem in O(κ(A) ln(1/ )) first-order iterations, where κ(A) is a certain condition m...
متن کاملA TRANSITION FROM TWO-PERSON ZERO-SUM GAMES TO COOPERATIVE GAMES WITH FUZZY PAYOFFS
In this paper, we deal with games with fuzzy payoffs. We proved that players who are playing a zero-sum game with fuzzy payoffs against Nature are able to increase their joint payoff, and hence their individual payoffs by cooperating. It is shown that, a cooperative game with the fuzzy characteristic function can be constructed via the optimal game values of the zero-sum games with fuzzy payoff...
متن کاملVehicle Routing Problem in Competitive Environment: Two-Person Nonzero Sum Game Approach
Vehicle routing problem is one of the most important issues in transportation. Among VRP problems, the competitive VRP is more important because there is a tough competition between distributors and retailers. In this study we introduced new method for VRP in competitive environment. In these methods Two-Person Nonzero Sum games are defined to choose equilibrium solution. Therefore, revenue giv...
متن کاملDistribution Design of Two Rival Decenteralized Supply Chains: a Two-person Nonzero Sum Game Theory Approach
We consider competition between two decentralized supply chains network under demand uncertainty. Each chain consists of one risk-averse manufacturer and a group of risk-averse retailers. These two chains present substitutable products to the geographical dispensed markets. The markets’ demands are contingent upon prices, service levels, and advertising efforts of two supply chains. We formulat...
متن کاملA simple and numerically stable primal-dual algorithm for computing Nash-equilibria in sequential games with incomplete information
We present a simple primal-dual algorithm for computing approximate Nash equilibria in two-person zero-sum sequential games with incomplete information and perfect recall (like Texas Hold’em poker). Our algorithm only performs basic iterations (i.e matvec multiplications, clipping, etc., and no calls to external first-order oracles, no matrix inversions, etc.) and is applicable to a broad class...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Math. Program.
دوره 133 شماره
صفحات -
تاریخ انتشار 2008